O 



Tight bounds on the maximum size of a set of 
permutations with bounded VC-dimension * 

Josef Cibulka^, Jan Kyncl^ 



^ Department of Applied Mathematics, 



■ Charles University, Faculty of Mathematics and Physics, 

Malostranske nam. 25, 118 00 Praha 1, Czech Republic; 
^ l' cibulka@kam.mff.cuni.cz 

< 

^ Department of Applied Mathematics and Institute for Theoretical Computer Science, 



Charles University, Faculty of Mathematics and Physics, 
Malostranske nam. 25, 118 00 Praha 1, Czech Republic; 

kynclSkam.mf f . cuni . cz 



Abstract 



o 
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■ The VC-dimension of a family V of n-permutations is the largest integer k 

such that the set of restrictions of the permutations in V on some /j-tuple of 
positions is the set of all k\ permutation patterns. Let rfc(n) be the maximum size 
• of a set of n-permutations with VC-dimension k. Raz showed that r2(n) grows 

J> . exponentially in n. We show that r3(n) = 2®("'^°sa(")) ^nd for every t > 1, we 

^ ; have r2t+2{n) = 2® and r2t+3{n) = 20(""(")*i°s"W). 

' We also study the maximum number Pk{n) of 1-entries in an n x n (0, 1)- 

•/^ ■ matrix with no {k + l)-tuple of columns containing all {k + l)-permutation ma- 

■rj" ! trices. We determine that, for example, P3{n) = Q{na{n)) and p2t+2('i-) = 

O ■ „2(Vt!)"W*±0{°{n)'-i) for every t > 1. 

We also show that for every positive s there is a slowly growing function Csin) 
(for example C2t+3(n) = 2<^ for every t > 1) satisfying the following. For all 

positive integers n and B and every n x n (0, l)-matrix M with 1-entries, 
the rows of M can be partitioned into s intervals so that at least B columns contain 



X 

. at least B 1-entries in each of the intervals. 
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1 Introduction 



Let T be a set system on [n] = {1,2, ... ,n}. We say that a set iiT C [n] is shattered by T 
if every subset of K appears as an intersection of K and some set from T. The Vapnik- 
Chervonenkis dimension (VC- dimension) of T is the size of the largest set shattered by 
T. Sauer's lemma gives the exact value of the maximum size of a set system on [n] with 
VC-dimension k, which is a polynomial in n of degree k. More on the VC-dimension 
and its history can be found for example in [T6] . 

Motivated by the so-called acyclic linear orders problem, Raz defined the VC- 
dimension of a set V of permutations: Let Sn be the set of all n-permutations, that 
is, permutations of [n]. The restriction of tt & Sn to the k-tuple (oi, 02, . . . , a^) of 
positions (where 1 < ai < 02 < ■ ■ • < < n) is the fc-permutation vr' satisfying 
\/i,i : n'ii) < vr'(j) vr(aj) < 7r(aj). The /c-tuple of positions (oi, . . . ,0^) is shattered 
by V if each A;-permutation appears as a restriction of some vr G P to (ai, . . . , a^). The 
VC-dimension of V is the size of the largest set of positions shattered by V. Let r/c(n) 
be the size of the largest set of n-permutations with VC-dimension k. 

Raz [23] proved that r2(n) < for some constant C and asked whether an expo- 
nential upper bound on rfc(n) can also be found for every /c > 3. 

An n-permutation vr avoids a fc-permutation p if none of the restrictions of tt to a k- 
tuple of positions is p. Clearly, the set of permutations avoiding p E Sk has VC-dimension 
smaller than k. Thus, Raz's question generalizes the Stanley-Wilf conjecture which 
states that the number of n-permutations that avoid an arbitrary fixed permutation p 
grows exponentially in n. The conjecture was settled by Marcus and Tardos [15] using 
a result of Klazar |I2] . 

We show in Section [2] that the size of a set of n-permutations with VC-dimension k 
cannot be much larger than exponential in n. The result has an application in enumer- 
ating simple complete topological graphs [H]. Let a{n) be the inverse of the Ackermann 
function; see Section 12.21 for its definition. 

Theorem 1.1. The sizes of sets of permutations with bounded VC-dimension satisfy 

r4(n) < 2"-(2°(")+3iog2(aW)+o(i))^ 
r2t+2in) < 2"-«2/*!)«(")*+o(«W*-^)) j^r every t > 2 and 

r2t+3{n) < 2"-((2/*0"W* log.,{a{n))+0{a{ny) ^^^^^ ^ y 

On the other hand, we give a negative answer to Raz's question in Section [31 
Theorem 1.2. We have 

rsH > (a(n)/2 -0(1))" and 
r2t+3{n) > r2t+2{n) > 2"-((iA!)«W*-OK")*-^)) y^r every t > 1. 

An n-permutation matrix is an n x n (0, l)-matrix with exactly one 1-entry in every 
row and every column. Permutations and permutation matrices are in a one-to-one 
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correspondence that assigns to a permutation tt a permutation matrix with ATr{i,j) = 
1 ^ 7r(j) = i 

An m X n (0, l)-matrix i? contains a /c x / (0, l)-matrix 5* if has a k x I submatrix 
T that can be obtained from S by changing some (possibly none) 0-entries to 1-entries. 
Otherwise B avoids S. Thus, a permutation vr avoids p if and only if avoids Ap. 
Fiiredi and Hajnal [7] studied the following problems from the extremal theory of (0, 1)- 
matrices. Given a matrix S (the forbidden matrix), what is the maximum number 
exs{n) of 1-entries in an n x n matrix that avoids S"? This area is closely related to 
Turan problems on graphs and to Davenport-Schinzel sequences. Functions exs or their 
asymptotics have been determined for some matrices S [71 [20l [25] and these results have 
found applications mostly in discrete geometry [2], El El [IB] and also in the analysis of 
algorithms [TH]. The Fiiredi-Hajnal conjecture states that exp(n) is linear in n whenever 
P is a permutation matrix. Marcus and Tardos proved this conjecture by a surprisingly 
simple argument [15]. This implied the relatively long standing Stanley- Wilf conjecture 
by Klazar's reduction [12j. An improved reduction yielding the upper bound 2'^('^^°sfc)"' 
on the size of a set of n-permutations with a forbidden /c-permutation was found by the 
first author [3]. 

We modify the question of Fiiredi and Hajnal and study the maximal number Pk{n) 
of 1-entries in an n x matrix such that no {k + l)-tuple of columns contains all (fc + 1)- 
permutation matrices. It can be easily shown that p2{n) = 4n — 4. Indeed, consider an 
nxn matrix with at least An — 3 1-entries. Remove the highest and the lowest 1-entry in 
every column. Then the first and the last row of the resulting matrix contain no 1-entry 
and thus one of the rows contains three 1-entries. The three columns of the original 
matrix containing these 1-entries contain every 3-permutation matrix. The lower bound 
An — A can be achieved for example by filling the two top rows and some two columns 
with I's. 

Theorem 1.3. We have 

2na{n) — 0{n) < Pz{n) < 0{na{n)), 

P2t+2{n) = n2(i/*')"(")'^°("(")'"') for every t > 1 and 
^2(iA!)"(n)'-o("W*-') < p2t+3(n) < ^2(^/*')"(")'^°*52(«(«))+o("W*) for ^very t>l. 

The upper bounds from Theorem 11.31 are proven as Corollary 12.41 in Section 12.11 and 
the lower bounds as Corollary 13.71 in Section 13. 2[ 

Let 5* and T be sequences. We say that 5* contains a pattern T if 5* contains a 
subsequence T' isomorphic to T, that is, T can be obtained from T' by a one-to-one 
renaming of the symbols. A sequence S over an alphabet F is a Davenport-Schinzel 
sequence of order s (a DS(s)-sequence for short) if no symbol appears on two consecutive 
positions and S does not contain the pattern abab ... of length s + 2. These sequences 
were introduced by Davenport and Schinzel [1] and found numerous applications in 
computational and combinatorial geometry. More can be found in the book of Sharir 
and Agarwal [21]. Let Xs{n) be the maximum length of a Davenport-Schinzel sequence 
over n symbols. The following are the current best bounds on Xs{n). 
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2na{n) - 0{n) < X^i^n) < 2na{n) + O \^n^/a{n)j , 

n ■ 2(lA0-(n)'-O(a(n)-^) < ^^^^^^^^ < ^ . 2(lA!)«(n)*+0(a(n)*-) ^ > ^ 

n ■ 2(lA!)"(n)*-0(a(n)*-) ^ ;^2^^3(^) < ^ . 2(lA!)-(n)* log, a(n)+0(a(n)*) fo^ t > 1. 

The upper bound on A3 is by Klazar [Hj, the lower bounds on for s > 3 are by 
Agarwal, Sharir and Shor [T] and all the other bounds were proved by Nivasch |il7j . 
Pettie [22] recently announced the following improved bounds: 

(](na(n)2"(")) < Xsin) < 0(na2(n)2"(")) and 

X2t+3in) < n ■ 2(iA!)-(n)'(i+o(i)) fo^ t > 2. 

Our proofs are based on several results on Davenport-Schinzel sequences as well as 
on sequences with other forbidden patterns. The results on sequences that we use are 
mentioned in more detail in Sections I2.H 12.21 and 13.11 where they are transformed into 
claims about matrices with forbidden patterns. 

An s -partition of the rows of an m x n matrix M is a partition of the interval 
of integers {1, . . . , m} into s intervals {1 = mi, . . . , m2 — 1}, {^2, . . . , — 1}, . . . , 
{rris, . . . ,m = rris+i — 1}. A matrix M contains a B-fat (r, s)-formation if there exists 
an s-partition of the rows and an r-tuple of columns each of which has B 1-entries in 
each interval of rows. Note that the order of the columns in the matrix is not important 
for this notion. See Fig. [T] for an example of a 1-fat (3, 4)-formation. In Section 12. 2^ 
we prove the following lemma, which gives an upper bound on the number of 1-entries 
an n X n matrix can have and still not contain any 5-fat (S, s)-formation. It is used 
in the proof of Theorem 11.11 in Section 12.31 analogously to the use of Raz's Technical 
Lemma [23] . 

Lemma 1.4. For all positive integers s,n and B, an n x n matrix M with at least 
(s{n)Bn 1-entries contains a B-fat {B , s)- formation, where Cs(^) ^^i"^ functions of the 
form 

C2(n) = 0(l), C3H = 0(«H), C4H = 0(«(n)2), CsW = 0(«H2"(")), 
C2t+3in) = 2(iA!)"(n)'+o(a(n)*-i) for t > 2 and 
C2t+4(n) = 2(i/*')"(")' i°g(°W)+o("(n)') fort>l. 

More generally, for all positive integers m, n, s and B, an m x n matrix M with at least 
(s{m)Bn 1-entries contains a B-fat {[nB /m\, s)-formation. 

The proof of the lemma is based on a proof of the upper bound on the number of 
symbols in the so-called formation-free sequences (see definition in Section 12. ip from 
Nivasch's paper [T7] . 

By an argument similar to the proof of P2{n) < 4n — 4 above, it is easy to verify that 
every mxn matrix M with at least 3n 1-entries contains a 1-fat ( [n/m] , 3)-formation. 
A similar result for 2-fat formations would slightly improve the upper bounds on r3(?T,) 
and ri{n). 

Problem 1.1. Does there exist a constant c such that for every m and n, every mxn 
matrix M with at least en 1-entries contains a 2-fat {\n/m\, 3) -formation? 

All logarithms in this paper are in base 2. 
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2 Upper bounds 



2.1 Numbers of 1-entries in matrices 

A sequence 5* of length I over an alphabet F is a function : [/] — )■ F. An (r, s)-formation 
is a sequence formed by s concatenated permutations of the same r-tuple of symbols. 
The permutations in a formation are its troops. A sequence S = (oi, . . . , a;) is r -sparse 
if 7^ aj whenever < |i — j| < r. An {r., s)- formation-free sequence is a sequence 
that is r-sparse and contains no (r, s)-formation as a subsequence. Let Fr^s{n) be the 
maximum length of an (r, s)-formation-free sequence over n symbols. Formation-free 
sequences were first studied by Klazar [TU]. 

To be able to use results on sequences for matrices, we use the matrix^ sequence 
transcription MST (our name) defined by Pettie [20j who improved an earlier transcrip- 
tion by Fiiredi and Hajnal [7j. The letters of the sequence correspond to the columns of 
the matrix. The matrix is transcribed row by row from top to bottom. Let Seqj_i be 
the sequence created from the first i — 1 rows. We consider the set Ct of letters corre- 
sponding to the columns having a 1-entry in the row i. The letters in Ci are ordered in 
the order of the last appearance in Seqj_i; the one that appeared last in Seq^.^ is first 
and so on. The letters that did not appear in Seq^.i are ordered arbitrarily and placed 
after those that did appear. The ordered sequence Ci is then appended to Seqj_p The 
length of the resulting sequence MST(M) = Seq^ is equal to the number of 1-entries in 
M and the size of the alphabet is n. Note that the previous papers ([3 transcribe 
the matrices column by column instead of row by row. 

A block in a sequence is a contiguous subsequence containing only distinct symbols. 
Note that MST(M) can be decomposed into m or fewer blocks. 

A set 5* of rs 1-entries forms an (r, s) -formation in M if there exists an s-partition 
of the rows and an r-tuple of columns each of which has a 1-entry of 5* in every interval 
of rows of the partition. See Fig. [H In this and all other figures, circles and full circles 
represent the 1-entries and empty space represents the 0-entries. A matrix M is {r,s)- 
formation-free if it contains no (r, s)-formation. 



Figure 1: A (3, 4)-formation on columns ji, j2 and J3. Full circles represent the 1-entries 
of the formation. Empty circles represent 1-entries outside of this formation. 

Lemma 2.1. A {0,l)-matrix M contains an {r, s)- formation if and only if MST (M) 
contains an (r, s) -formation. 



Proof. Observe that an (r, s)-formation in a matrix M implies an (r, s)-formation in 
MST(M). 



Jl 



J2 Ji 
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The proof of the other direction is more comphcated, because symbols of one block of 
MST(M) may be present in two troops of the (r, s)-formation in MST(M). To overcome 
this complication, we consider such an (r, s)-formation in MST(M), whose each troop 
ends earliest possible. Assume that the i-th troop ends with an occurrence of a symbol 
a in the j-th block of MST(M) and that the {i + l)-st troop begins with b from the j-th 
block. Since a precedes b in the j-th block, we know, by the definition of MST(M), that 
a appears somewhere between the occurrences of b and a of the i-th troop. Therefore, 
the i-th troop could end earlier, contradicting the selection of the (r, s)-formation. □ 

Nivasch gives the following upper bound on the maximum length Fr^s{n) of an (r, s)- 
f or mat ion-free sequence on n symbols: 

Theorem 2.2. ([171 Theorem 1.3]) For every r G N 



Let Pfe(^) be the maximum number of 1-entries in an (A; + 1, A; + l)-formation-free 
n X n matrix. Theorem 12.21 implies the following upper bounds on p'k{n). 

Lemma 2.3. We have 



Proof. Take a {k + l,k + l)-formation-free matrix M. Then MST(M) does not contain 
any [k + 1, k + l)-formation by Lemma [2. 1[ 

The sequence MST(M) = ai,a2, ■ ■ ■ ,ap can be made {k + l)-sparse by removing 
at most kn occurrences of symbols. Indeed, whenever two occurrences ai,aj (where 
i < j) of the same symbol appear at distance at most k, then is among the last k 
symbols preceding the block containing aj. Thus, it suffices to take the blocks from left 
to right and in each of them remove the at most k symbols that appear as the last k 
symbols preceding the block. The resulting sequence is thus a {k + l,k + l)-formation- 
free sequence of length differing by 0{n) from the number of 1-entries of M. The result 
then follows from Theorem 12.21 □ 

This proves the upper bounds in Theorem 11.31 by observing that a (fc + l)-tuple of 
columns with a {k + l,k + l)-formation contains every {k + l)-permutation matrix. 

Corollary 2.4. For every fixed k > 3 if we let t := [_{k — 2)/2j, then 
Psin) < 0{na{n)), 



Fr,4{n) < 0{na{n)). 




when s is even and 




Pk{n) < n2(i/*')°(") 
Pk{n) < n2(i/*')"(") 



I* log(a(n))+0(a(n)*) 



when k is odd and greater than 3 and 



i*+0(Q(n)«-i) 



when k is even. 
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2.2 Fat formations in matrices 

A sequence 5* is an AFF r,s,ki'm)-sequenc^ if it contains no (r, s)-formation as a subse- 
quence, can be decomposed into m or fewer blocks and each symbol of the sequence 
appears at least k times. Let 11^ ^^(m) be the maximum number of symbols in an 
AFFr_5^fc (m)-sequence. 

Let ad{m) be the cith function in the inverse Ackermann hierarchy. That is, ai{m) = 
\m/2], arf(l) = for d > 2 and ad{rn) = 1 + ad{ad-i{nT')) for rn,d > 2. The inverse 
Ackermann function is defined as a{m) := min{k : ak{m) < 3}. 

Nivasch defines a hierarchy of functions Rs{d), which we shift by 1 in the index. That 
is, our Rs{d) is the original Rs-i{d). We thus have the functions defined for s > 2 and 
d>2. The values are R2id) = 2, R3{d) = 3, Ri{d) = 2rf + 1, i?,(2) = 2^-^ + 1 and 

Rs{d) = 2{Rs.i{d) - 1) + {Rs^2id) - - 1) - 3) + 1 when s > 5 and d > 3. 

For s > 5, if we let t = [(s - 3)/2j, then Rs{d) = 2(Vt!)°!* iog(d)+o(d') jf ^ jg ^^^^ ^-^^ 
Rs{d) = 2(Vt!)°!*+o('i'-^) when s is odd. 

Lemma 2.5. ( pTl Corollary 5.14]) For every d > 2, s > 3, r > 2, m and k satisfying 
m > k > Rs{d) we have 

where c(, is a constant depending only on s. 

The linear dependence of the upper bound on r is not explicitly mentioned in |17j . 
but can be revealed from the proof. In the base case, the dependence on r is linear 
(Lemmas 5.9 and 5.10 in [T7j) and in Recurrences 5.11 and 5.13, the right-hand side can 
be rewritten as r times an expression not depending on r. 

It was shown [131 that doubling letters in the forbidden subsequence usually 
has small impact on the maximum length of a generalized DS-sequence (see the defini- 
tion in [in]). Geneson [8] generalized the linear upper bound from the Fiiredi-Hajnal 
conjecture to forbidden double permutation matrices. We show a similar behavior of 
formation-free sequences and matrices. For s > 2, a set 5* of r{2s — 2) 1-entries forms 
a doubled [r, s)- formation in M if there exists an s-partition of the rows and an r- 
tuple of columns each of which has one 1-entry of S in the top and bottom interval of 
rows of the partition and two 1-entries in every other interval. A matrix M is doubled 
{r, s)- formation- free if it contains no doubled (r, s)-formation. A DFFr,s,fc (m)-mainx 
is a doubled (r, s)-formation-free matrix with m rows and at least k 1-entries in every 
column. Let A^ ^ ^(m) be the maximum number of columns in a DFF^ fc(m)-matrix. 

In Corollary 12 . 1 21 we show an analogue of Lemma 12. 5l for doubled (r, s)-formation-free 
matrices. The proof follows the structure of the proof of Corollary 5.14 in [TT]. First, 
we show some simple bounds on Ar,s,fc(m). The case = 2 of Corollary 12. 121 is proved in 
Corollary 12.101 by Recurrence 12.91 and the remaining cases follow from Recurrence 12.111 
Corollary 12. 121 will give a sequence of upper bounds on Ar,s,A:(^)- Typically, the bounds 
are superlinear in m for r, s and k fixed and the subsequence of bounds applicable is 
limited by the values of s and k. As k grows (keeping r and s fixed) the best applicable 

^ AFF is an abbreviation for almost-formation-free. 



7 



bound gets closer and closer to linear. When one lets be a suitable function of a;(m), 
the bound becomes linear in m. 

li m < k, no matrix with m rows can have k I's in every column. 

Observation 2.6. For every r,s,k,m, if m <k, then 

Ar,s,k{'m) = 0. 

Observation 2.7. For every r, s, k, m, if k < 2s — 2, then 

Ar,s,k{m) = oo. 

Analogously to [T71 Lemma 5.10], all the other values of Aj.^s yfc(m) are finite. 
Observation 2.8. For every r > 2, s > 2 and m > 2s — 2 

Ar,.,2s-2(m) < (r - 1) f ^ _ ^ j < rm'-\ 

Proof. If each column in an r-tuple of columns has the same position of the 2nd, 4th, 
. . . , (2s — 2)nd 1-entry, then the first 2s — 2 1-entries from the columns form a doubled 
(r, s)-formation. □ 

Recurrence 2.9. For every r, k, m and s > 3 

A,.,s,2fc+i(2m) < 2A,.^,2fc+i("^) + 2A^,s_i,fc(m). 

Proof. As in the proof of [T71 Recurrence 5.11], we cut the rows of a DFFj. 2fc+i(2rn)- 
matrix into the upper m rows and the lower m rows. The local columns are those with 
all 1-entries in the same half of rows. There are at most 2Ar.s,2A:+i(^) local columns. 
Columns that are not local are global. Consider the submatrix M[ formed by the upper 
half of rows of global columns with at least half of their I's in the upper half of rows. Let 
Ml be the matrix created from M[ by removing the lowest 1 in every column of M[. If 
Ml contains a doubled (r, s — l)-formation, then M contains a doubled (r, s)-formation. 
Thus Ml has at most Ar^s-i^kiiT^) columns. A symmetric argument can be applied on 
the global columns with at least half of their I's in the lower half of rows. □ 

Corollary 2.10. For every fixed s > 2 and for all integers r, /c, m satisfying k > 2^^^ + 
2"*^^ — 1 we have 

Ar,s,k{m) < Qrmlog(m)^"^, 
where Cg is a constant depending only on s. 

Proof. The proof proceeds by induction on s and m. The base case of s = 2 follows 
from Observation 12.81 and the cases with m < k from Observation 12.61 Recurrence 12.91 
is used as the induction step. □ 

Recurrence 2.11. For every nonnegative r,m, ki, k2, k^, k^ and t satisfying m > t, 
ki > k2 + l > 2 and ki>k^> 3, if we let k = 2ki + (/c2 + 1) (^3 ~ 3) + {k^ — ^3) + 1, then 

A,,,,fc(m) < (^1 + - j (A,,,,fc(t) + 2Ar,s-i,kM + A,,,_2,fe(t)) + 

+ n;^,^fc3 + Ar,sM (1 + j) fors>A and 

(Tfi \ 
1 + jj {Ar,s,k{t) + 2A,,,_i,fc,(t) + r - 1) + 



+ KsM, ( 1 + 7 j + ^-'^''^4 (1 + y j for s 
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Proof. Consider a DFFr.s,fc(^)-niatrix M. We partition the rows of M into b := [m/t] < 
m/t + 1 layers Li, . . . , L;, of at most t consecutive rows each. 
A column is 

• local in layer Lj if all its I's appear in layer Li, 

• top- concentrated in layer Lj if it has at least ki + 1 I's in layer Lj and at least one 
1-entry below Lj, 

• bottom- concentrated in layer Li if it has at least fci + 1 I's in layer Li and at least 
one 1-entry above Lj, 

• middle- concentrated in layer Li if it has at least ^2 + 2 I's in layer Li and at least 
one 1-entry above and one below layer Li, 

• doubly-scattered if it has at least two I's in at least layers, 

• scattered if it has a 1-entry in at least layers. 

These categories are analogous to those used by Nivasch [17j, except that we added 
the category of doubly-scattered columns. This allows us to use 11' instead of A in 
one summand of the recurrence. As one of the consequences, when s > 6, the upper 
bound on the maximum number of I's in a doubled (r, s)-formation-free nx n matrix 
in Lemma [1.41 is similar to the best known upper bound on .;(n), although it is closer 
to Fr^s+i{n) when s = 3. 

Every column falls into one of these categories. If a column is in none of them, then 
its number of I's is maximized when it has ki I's in its top and bottom nonzero layers, 
k2 + 1 I's in some other k^ — 3 layers and a single 1 in some additional ^4 — k^ layers. 
Thus it contains only at most 2ki + (^2 + 1)(^3 ~ 3) + (^4 — ^3) < k — 1 1-entries. 

For each layer Li, the number of columns local in Li is at most Ar^s,k{t)- For ev- 
ery fixed i we consider the columns that are top-concentrated in Li and let be the 
submatrix of M defined by these columns and the rows of Li. Let Mj be obtained 
from Ml by removing the lowest 1-entry from every column. If Mj contains a doubled 
(r, s — l)-formation, then M contains a doubled (r, s)-formation. Thus there are at most 
Ar.,s-i,fci(t) columns top-concentrated in Li. Similarly, there are at most Ar^s-i,kii^) 
columns bottom-concentrated in Lj. For s > 4, there at most Ar^s-2,k2{t) columns 
middle-concentrated in Lj. For s = 3, there are at most r — 1 columns middle concen- 
trated in Li, because an r-tuple of columns with two I's in layer Li and at least one 1 
above and one below contains a doubled (r, 3)-formation. 

To bound the number of doubly-scattered columns, we contract each layer into a 
single row. That is, we write 1 for every column containing at least two I's in the 
layer and otherwise. If there is an (r, s)-formation on the contracted doubly-scattered 
columns, then M contains a doubled (r, s)-formation. Thus, by Lemma 12.11 there are 
at most fcair^^/'^l) doubly-scattered columns. By a similar argument, the number of 
scattered columns is at most Ar,s,fc4( [m/t] ). The only difference is that while contracting, 
we write 1 for the columns containing at least one 1 in the layer. □ 
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Similarly to Nivasch's functions Rs{d), we define a hierarchy of functions Ds{d), 
where s > 1 and > 2, as follows: Di{d) = 0, D2id) = 2, Ds{2) = 2''^ + 2'-^ - 1 and 
when s,d> 3 

Ds{d) = 2Ds-i{d) + {Ds-2{d) + l){Rs{d - 1) - 3) + Ds{d - 1) - Rs{d - 1) + 1. 
Then 

D^{d) = 2rf + 1, Di{d) < 0{d^), D^{d) < 0{d2'^), 
I)2m(c^) < 2(^/*')"'+°("'") fort>2and D2t+Aid) < 2^'/''-^^''°^^''^+''^'''^ for t > 1. 

Corollary 2.12. For every d > 2, s > 2, r > 2, m and k satisfying m > k > Ds{d) we 
have 

Ar^s,k{^) < Csrmad{my~'^ , 
where Cg is a constant depending only on s. 

Proof. The proof proceeds by induction on d, s and m similarly to the proof of [TTJ 
Corollary 4.12]. In the case s = 2, we apply Observation 12.81 and so the lemma holds 
with C2 = 1. For every s > 3 let mo{s) be a constant such that 

m > 1 + (6s)'^ \\og2{m)Y for every m > mo(s). 

Let Ci = C2 = 1 and for s > 3 we define in the order of increasing s as 

Cs := max{4, c^, 9Cs-i, 9Cs-2, mo{sy~^}, 

where Cg is the constant from Corollary 12.101 and c'g is the constant from Lemma [2751 For 
every s > 3 and d > 2, we define a function a^^s by d2,s{fn) = Rog("^)l, Cid,s{^) = 1 if 
m < mo{s) and 

ad,s{'rn) = 1 + ad,siQsad-i,s{my~'^) otherwise. 

Then Q;d_s(m) is well defined and differs by at most an additive constant (depending on s) 
from the values of the cith inverse Ackermann function ad{m) for all s, c? and m (this can 
be shown similarly to [IZl Appendix C]). The functions also satisfy dd,s{fn) > ttd,s-i('Ti). 
It is thus enough to prove 

Ar^s^kim) <Csrmdd,s{rnY''^ . 

The case d = 2 follows from Corollary 12.101 The cases m < mo{s) follow from Observa- 
tion [231 Now s > 3, d > 3 and m > mo(s). We apply Recurrence 12.111 with: 

fci = Ds-i{d), k2 = Ds-2id), k, = R,{d - 1), 

ki = Ds{d — 1), k = Ds{d) and t = 6sQ;(i_i_s(m)'^~^. 
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By the induction hypothesis, 



2^r,s-i,kA't) + A,,,_2,fc2(t) < r^tad,s{my-^ when s > 4, 

2^r,s-i,kA^) + - 1 < r^tad,sijny~^ when s = 3, 

2m 



Ar-,s,fc4 (^1 + y j < Csr—ad-i,s[m) < r— < r—mad,s{m) for s > 3 



and by Lemma [2.51 

TT/ f ^\ ^ 2m _ o Cs Cs _ , ,„_o 

^r,sM + y) - ^^^s—ad^iA^) < r—m < r—mad,s{fn) . 

Substituting into Recurrence 12.111 we get 

K,s,k{^) < —^r,s,kit) + ^r,s,kit) + (m + t)r^ad,sijny~'^ + -^rmad,sijny~'^ 

< jAr,s,kit) + ^rmarf,,(m)^~3 + A,,,,fc(t) + jrtad,s{my-^. 

By Observation 12. 8[ Ar,s,k(t) < rf^^^ < r{6sad-i,si^y^^'^)''^^^ , which is at most rm, 
because m > mo(s). So Ar,s,fc(t) < Csrm/d. Similarly ta^^sl"^)*"^ < m/3. Thus 

Ar,^,fc(m) < '^Ar,s,k{t) + ^-^rmad,s{my~^ + ^rm + ^rm 

777- 

< —/\r,s,kit) +Csrmad,s{my~^ 



TTl 

— ~^sftcid,sity~'^ + Csrrnad^sijny~'^ by the induction hypothesis 

s-2 I ^ (^\s~?'\ 



< mCsT ■ ((arf,,(m) - 1)" ' + ad,simy ^) 

< Csrmad,s{i^)^~'^ ■ CD 

Let Psirn) := Ds{a{m)). 

Corollary 2.13. An m x n matrix M with at least /3s(m) 1-entries in every column 
contains a doubled ( \_{n — 1)/ {mc'g)\ , s)-formation, where c's is a constant depending only 
on s. 

Proof. Let c's = Cs3^~^, where Cg is the constant from Corollary 12.121 and let r = [{n — 
1) / {mc's)\ . If M did not contain a doubled (r, s)-formation, its number of columns would 
be, by Corollary 12.121 with d = a(m), at most 

Csrmaa(m){^y~'^ < rmCs3'^~^ = [{n — l)/(mc^)Jmc^ < n. □ 

A set S of Brs 1-entries forms a B-fat {r, s)- formation in M if there exists an s- 
partition of the rows and an r-tuple of columns each of which has B 1-entries of S in 
each interval. A matrix M is B-fat {r, s)- formation- free if it contains no i?-fat {r,s)- 
formation. 



We now prove a more precise version of Lemma 11.41 
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Lemma 2.14. For all positive integers m, n, s and B, an mx n matrix M with at least 
2(/3s(m) + 2)Bn 1-entries contains a B-fat {[nB /{mcs)\, s)- formation, where Cg is a 
constant depending only on s. 

Proof. We transform the given matrix M to a matrix M with the same number of 1- 
entries in every column using the idea from the proof of Lemma 4.1 from [T7]. Let v{q) 
be the number of 1-entries in a column q of M. In every column q, we split the 1-entries 
into chunks of consecutive {(is{m) + 2)B I's. The last less than {Ps{m) + 2)B I's are 
discarded. Each of the chunks gets its own column with 1-entries in the rows where the 
1-entries of the chunk lie. These columns form the matrix M. Note that the order in 
which the columns are placed to M is not important. Because we discarded at most 
{I3s{m) + 2)Bn I's and every column of M has exactly + 2)B I's, M has at least 

n columns. Observe that for every r and s if M contains a B-iat (r, s)-formation, then 
so does M. 

We consider only the first n columns of M. We also remove at most B — 1 rows so 
as to have the number of rows divisible by B. We still have at least (/3s(m) + 1)1? I's 
in every column. In each column q, we select a set S of I's such that none of them is 
among the first or the last B — 1 I's of the column q and there are at least B — 1 I's 
between every two I's of 5*. We take S of size f3s{m) and remove all the other I's in q. 
The rows of M are now grouped into intervals of rows {iB + 1, . . . , (z + 1)B}. By the 
choice of S, every column contains at most one 1-entry in every interval. We obtain M 
by contracting each of the intervals of rows into a single row. 

The matrix M has \rn/B\ rows and n columns, each of them having Ps{i^) I's- It 
thus contains a doubled ( [(n— 1)5/ {mc'g)\ , s)-formation by Corollary 12. 131 By the choice 
of S, this implies that M and consequently M contain a B-fat ([(n — l)B/{mc'g)\,s)- 
formation. □ 



Proof of Lemma L4 Let C 



[m] 



2{f3s{m) + 2)max{l,Cs}, where Cg is the constant 
from Lemma 12.141 Let M be an m x n matrix with at least (s{m)Bn 1-entries. By 
Lemma [2. 141 M contains a B-fai ( [ni^/mj , s)-formation. □ 



2.3 Sets of permutations with bounded VC-dimension 

In this section we prove Theorem 11.11 It will be more convenient for the proof to 
substitute the permutations by their corresponding permutation matrices. That is, we 
have a set V of n-permutation matrices and for every {k + l)-tuple (oi, . . . , a^+i) of 
columns, there is a forbidden {k + l)-permutation matrix Sai^...^ak+i- 

Let M-p be a (0, l)-matrix with 1-entries on the positions where at least one matrix 
from V has a 1-entry. Let \M\ denote the number of 1-entries in a (0, l)-matrix M 
and let v{V) = v{Mp) = \M'p\/n. Similarly to Raz's proof of the exponential upper 
bound on r2{n) |23], we will remove matrices from V until we decrease v{V) below some 
threshold T{n). When v{V) < T{n), then \V\ < T{n)"' since the number of permutation 
matrices contained in is bounded from above by the maximum of a product of n 
numbers with sum v{V)n. 

Let 7fe(n) = 2{k + l)\(k^i{n), where Ck+i{n) is the function from Lemma IT^ 
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Lemma 2.15. Let V be a set of n-permutation matrices with VC- dimension k such that 
v(V) > 27A;(n). Then there is a set V CV satisfying 

viV') < viV) - 4^ 



\V'\ > 



%{n)n 



Proof. Let B := \y{V)/(k+i{'n)\. By Lemma ll.4[ the matrix M-p contains a i?-fat 
{B, k + l)-formation. Let C be the set of the B columns of the formation and let 
71 = {Ri, i?2, • • • , -Rfc+i} be the intervals of rows of the formation. 

Consider some t-tuple Q = {qi, ... ,qt} of columns from C, where t < k + 1. Let Zq 
be the set of all injective functions I : Q ^ TZ assigning the intervals Rj to the columns 
qi. We say that a permutation matrix P obeys I E Iq if for every i E {1,2, . . . , t}, the 
1-entry of P in the column lies in some row of I{qi). For each I eXq let Vi be the set 
of matrices P eV that obey /. The t-tuple Q of columns is said to be criss-crossed if 

VJgXq : \Vi\ > \V\/v{Vy. 

Suppose that some {k + l)-tuple of columns from C is criss-crossed. Then every 
{k + l)-permutation appears as a restriction of some matrix from V on the criss-crossed 
{k + l)-tuple of columns. Hence the VC-dimension of V is at least k + 1. 

Consequently, there is some t such that < t < A; and the largest criss-crossed set Q 
of columns from C has size t. This means that for every column u outside Q, we can find 
an injective function Ju E XQu{n} such that \VjJ < \V\/v(Vy~^^. On the other hand, if 
we restrict J„ on Q, the resulting function /„ := Ju \ Q satisfies \Vi^\ > \V\/v{Vy. To 
each choice of u E C\Q, we assign the function E Xq and the interval Ju{u) of rows. 
Some function I E Xq was then assigned to at least (|C| — A;)/|Xq| columns. Because 
B>A{k + 1)!, we have 

> B-k ^ v{V) ^ v{V) 



\c 




k 









(fc + l)! - 2a+i(n)(fc + l)! - 7fc(n)- 

Let 7/ be the set of some [f (P)/7fe(r;,)] columns that were assigned the function /. 
Because f ("P) > 27^(72), we have 

< ir,| < < ^. (2,1) 

lk\n) 7fc(n) 2 

For each column qi E Q , we remove from M all 1-entries in the column q^ except 
those that lie in the rows of I{qi). This reduces the number of permutation matrices, 
but there are still at least \'P\/v{Vy of them. Then we remove from M the 1-entries 
in each column u E Ti that lie in the set of rows Ju{u)- See Fig. [2J Thus we removed 
at least B 1-entries from each of these columns. The removed 1-entries of each of these 
columns decreased the number of permutation matrices by at most \Vj^\ < \V\/v{Vy~^^. 

Let V G V he the set of permutation matrices containing none of the removed 
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Figure 2: A criss-crossed set Q of 3 columns and a set 7} of columns {ui, . . . , u^}, where 
I{qi) = R4, I{q2) = Ri, His) = R2, JuAui) = J„2(m2) = R3 and Junius) = JuMi) = 
Ju^iuh) = Rh- The 1-entries from the crossed rectangles are removed. 

1-entries. Using the bounds from Equation fl2.ip . we obtain 

|Mp,| < \Mr\ - B\Ti\ < nv{V) - 

W \ > - — — > — — — > — — —. □ 

' ' - v{vy v(vy+^ - 2vivy - 2v{vy 

Proof of Theorem \l.li Let P be a set of permutation matrices with VC-dimension k. 
We will bound its size by iteratively applying Lemma 12.151 Let Vq = V and for j > 1 
let Vj be the V given by the lemma applied on Vj-i. 

The iterations are further grouped into phases. Let 0o •= 0- Phase i ends after the 
first iteration 0j after which v{V^J < v{V^^_^) /2. Let Vi := v(Vfjj.). Then an iteration 
of phase i is applied on a set V of permutations satisfying 

^ < v{V) < v,.i (2.2) 

Further, let 

T:=7fc'Hlog(7fcH). (2.3) 
We end after the first phase I satisfying vi < 2T. We thus have 

|P^J < (2T)^ (2.4) 

For every i > 1 we have 

vi-i > TT. (2.5) 

We now count the number of iterations in phase i. By Lemma [2.151 and ( fl2.2p ). each 
of these iterations decreases v(V) by at least vf_^i/{4:'yl{n)n). Therefore the phase ends 
after at most [27|(n)n/f < 37^(n)n/fj_i iterations. Consequently 

I < \V^. I ■ (2t;f_i)'^'^"^"/''-^ by Lemma MB and (Q 
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and 



i=0 

< \V^^\2^^^U^^''^U^°^(^'T)/{rT) ^ gj^^g MfO is decreasing on [2T, oo) 

< (2T)" ■ 26'=T'(")"(2+log T)/T ^ 

< (27fc2(n) log(7fc(n)))" ■ 2^0'=" by (E^l). 

Since 7fc(n) G 0(Cfe+i(^)), we have 

r,in)< (O(a(n)^log(a(n))))", 
r4(n) < 2"-(2"(")+3iogW"))+o(i))^ 
r2t+2in) < 2"-((2A!)-(n)'+OKn)*-)) for t > 1 and 

r2t+3{n) < 2-((2A0-W*log{a{n))+O(a(n)*)) ^ > □ 

Remark. Let an ra-function be a total function / : [n] — t- [n]. We say that a set J-" 
of ra-functions has VC-dimension with respect to permutations (abbreviated as pVC- 
dimension) /c if /c is the largest integer such that the set of restrictions of the functions 
in J-" to some fc-tuple of elements from [n] contains all fc-permutations. Let r^(n) be the 
size of the largest set of n-functions with pVC-dimension k. Observe that the proofs of 
this section never use the fact that the matrices in V have exactly one 1-entry in every 
row. Thus the upper bound from Theorem 1 1 . 1 1 also holds with r'^{n) in place of rk{n). 



3 Lower bounds 

3.1 Matrices from sequences 

Let DSs be the s x 2 matrix with 1-entry in the ith row and jth column exactly when 
z + j is odd. For example 




Based on a construction of Davenport-Schinzel sequences of order 3 and length Q{na{n)) 
by Hart and Sharir [9j, Fiiredi and Hajnal |7j constructed n x n DS'4-avoiding matrices 
An with Q{na{n)) 1-entries. We will use a different construction of DS(s)-sequences 
of orders s = 3 and all even s > 4 by Nivasch [17] that together with the following 
transcription will provide us with DS's+i-avoiding matrices with the additional property 
of having the same number of 1-entries in every column. 

Let be a sequence over n symbols that can be partitioned into m blocks. Recall 
that each block contains only distinct symbols. We number the symbols l,...,n in 
the increasing order of their first appearance. The sequence^matrix transcription of S, 
SMT(S'), is the mxn {0, l)-matrix with a 1-entry in the ith row and jth column exactly 
if the ith block in the sequence contains the symbol j. 
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Observation 3.1. ([7]) If S is a sequence avoiding the alternating pattern aba... of 
length s + 2, then SMT(S') avoids DSs+i- 

Proof. If SMT(S') contains DSs+i, tlien S contains the alternating sequence ba . . . of 
length s + 1 for some a < b. By the numbering of the symbols, the symbol a appears 
before the first occurrence of b, therefore 5* contains aba ... of length s + 2 and thus S 
is not a DS(s)-sequence. □ 

Lemma 3.2. For every n there exists an n x n DS^-avoiding matrix Mn with at least 
2a{n) — 0(1) 1-entries in every column. 

Proof. Let Ad{x) be the cith function of the Ackermann hierarchy. We refer the reader 
to Nivasch's paper [17] for the definition. Let A{x) = Ax{3) be the Ackermann function. 

In Section 6 of [T7], Nivasch constructs for every d, m > 1 an ababa-free sequence 
Zd{m). We use the sequences Z'^ = Zd{Sd + 4) which have the following properties: 

• Each symbol appears exactly 2d + 1 times. 

• The sequence can be decomposed into blocks of average length at least id + 2 
(by [IZl Lemma 6.2]). 

• The number A''^ of symbols of the sequence is at most Ad{Sd + 4 + c) (by [T71 
Lemma 6.2]), where c is an absolute constant. 

Let Md be the number of blocks of Z'^. By counting the length of Z'^ in two ways, 
{2d + l)Nd > (M + 2)Md and thus A'^ > 2Mrf. By the analysis before Equation (35) 
in |T7], there is some do such that for d > d^ we have 

Nd < Ad{8d + 4 + c) < AdiA{d + 1)) = A{d + 2) and so 
a{Nd) <d + 2. 

Then SMT(Z^) is an Md x A^^ matrix with 2d + 1 > 2a{Nd) — 3 1-entries in every 
column. By Observation 13. 1[ SMT(Z^) avoids DS4. We construct the matrix M^^ by 
adding empty rows to SMT(Z^). 

For values n > Nd^ different from Nd, we proceed similarly to the method in Section 6 
of [17]: We consider the largest A^^ smaller than n and take In/Nd] copies of SMT(Z^). 
We place the copies into a single matrix so that each copy has its own set of consecutive 
rows and columns. After removing at most half of the columns, we obtain a matrix with 
exactly n columns and at most n rows. The matrix has at least 2d + l > 2a{Nd+i) — 5 > 
2a{n) — 5 1-entries in every column. The construction of M„ is then finished by adding 
empty rows to obtain a square matrix. □ 

Lemma 3.3. For every t > 1 and n there exists an n x n D 821+3- avoiding "matrix with 
at least 2^^/*'^"^"^ -O(o{n)* ) x.gj^^j-^gg in ^y^ry column. In particular, 

exz,5..+3(n)>n2(i/*')"(")*-«("(")'-^). 

Proof. Let s := 2t + 2. Since s is even and s > 4, we can use Nivasch's construction [TTf 

/ * \ 

Section 7] of DS(s)-sequences Sl{m) with parameters k,m > 2. Let Hs^k) := 2'^(«-2)/2/. 
We use the sequences S'^ = S'|(2/is(fc)), which have the following properties: 
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• Each symbol of S*^ ^ appears exactly fis{k) times (by [IZl Equation (47)]). 

• The sequence can be decomposed into blocks of length 2ns{k). 

• For every k > ko{s), where ko{s) is a properly chosen constant, the number A^<, ^ 
of symbols of the sequence satisfies a{Ns^k) < k + 3 (by [T71 Equations (50), (51)] 
and analysis similar to the one in the proof of Lemma [3.2p . 

Let Mg^k be the number of blocks of S'^j^. It satisfies 2Ms ^ < Ng^k. The matrix 
SMT(S'^^) is a DSs+i-avoiding M^^^ x A^^ ^ matrix with at least fig^k 1-entries in every 
column. For every n > Nsfy^(s) we take the largest k such that n > Ng^k and proceed in 
the same way as in the proof of Lemma 13^ with SMT(S'^^) in the place of SMT(Z^). 
We have 

k > a{Ng^k+i) - 4 > a{n) - 4 
and so the number of 1-entries in every column of the resulting matrix is 

^^(^^) ^ 2((-2)/2) = 2(t) > 2(Vt!)fe'-o(fc*-i) > 2(iA!)"(")*-o("(n)'-^). □ 

Remark. We could also use the construction of DS'4-avoiding matrices with Q{na{n)) 
1-entries by Fiiredi and Hajnal [7]. The matrices do not have the same number of 1- 
entries in every column, but it can be shown that every column has at most constant 
multiple of the average number of 1-entries per column. This would be enough for our 
purposes. The base case of the inductive construction in [7] needs a small fix. The 
matrices M{s, 1) and M(l, s) do not satisfy conditions imposed on them. This can be 
fixed for example by taking ( J 1 ) for M{s, 1) and the matrix with the leftmost column 
full of 1-entries and with no 1-entries in the other columns for M(l, s). 

3.2 Numbers of 1-entries in matrices 

A matrix is k-full if some fc-tuple of its columns contains every /c-permutation matrix. 
The fullness of a matrix A is the largest k such that A is fc-full. In this section we 
show a lower bound on the maximum number Pk{n) of 1-entries in an x matrix with 
fullness k. This is achieved by showing that a fc-full matrix contains the matrix DSk 
and applying the results from Section 13.11 We prove a slightly stronger statement that 
will be used in the next section. 

Let J2 := (. *). For an /-permutation matrix P, we define the J2-expansion of P, 
P"^^, to be the 21 x 21 permutation matrix created by substituting every 1-entry of P by 
J2 and every 0-entry by a 2 x 2 block full of 0-entries. 

A pair of rows 2i, 2i + 1 of P'^^ will be called contractible if the 1-entry in row 2i is 
to the left of the 1-entry in row 2z + 1. That is, when 7r~-'^(z) < 7r~^(z + 1), where vr is 
the permutation corresponding to P. To contract a pair of rows means to replace them 
by a single row with 1-entries in the columns where at least one of the two original rows 
had a 1-entry. 

Let an (n, m)-function be a total function f : [n] ^ [m]. A function matrix is a (0, 1)- 
matrix with exactly one 1-entry in every column. Assigning to a function / a function 
matrix Gf with Gf{i,j) = 1 <;=^ f{j) = i provides a bijection between (ra, m) -functions 
and m X n function matrices. 
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The set of J2- expansion flattenings of P is the set J^{P^^) of function matrices that 
can be obtained from P-^^ by contracting some pairs of contractible rows. Let Vi be the 
set of /-permutation matrices and let 



$(/) := {^(P^^) : P G Vi}. 



For example 



*(2)^{{(..--)},{C 



}}■ 



Lemma 3.4. If an n x 21 matrix A contains one matrix from J-'(P'^^) for every l- 
permutation matrix P, then A contains an occurrence of DS21 on columns {2i — l,2i} 
for some i E [I]. 

Proof. We proceed by induction on /. The case / = 1 is trivial since $(/) = {{</2}} = 

The ith pair of columns of A is the pair of columns {2i — 1, 2i}. For each z < / let hi 
be the smallest number such that the ith pair of columns of A contains J2 on a subset 
of rows {1 ... hi}. Let t be the largest number satisfying Vz hi < ht. Let A^* be the 
{n — ht + 1) X 2(Z — 1) matrix obtained from A by removing the columns of the tth pair, 
removing the top ht — 1 rows and then changing all 1-entries among the first 2{t — 1) 
entries in the first row to O's. See Fig. [3] 

1st 

pair 2rid 3rd 4th 




• • • 

• • 



Figure 3: Induction step in the proof of Lemma [3.41 In this example t = 3. 

For every P with the topmost 1-entry in column t, A contains an occurrence of some 
F G J-'(P''2), that uses the two 1-entries of the topmost occurrence of J2 on the tth 
pair of columns. These occurrences induce an occurrence of some matrix from every set 
J-" G $(/ — 1) in A^*. By the induction hypothesis, A^* contains DS2{i-i) on some ith 
pair of columns. By the choice of t, this occurrence of DS2(i-i) in A does not use any of 
the rows {1 . . . hi}. Thus we obtain an occurrence of -05*2; in A. □ 

For an /-permutation matrix P and i < 2/ + 1 we define P-^^li) to be the {21 + 1)- 
permutation matrix that becomes P"^^ after removing the lowest row and column i. Then 
J-'{P'^'^,i) is the set of function matrices that can be obtained from P^'^{i) by contracting 
some pairs of contractible rows. For example 
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Let 

:= {J'{P'\i):P eVi}. 

Lemma 3.5. Let A he an n ^ {21 + 1) matrix and let A' be the matrix obtained from A 
by removing the last 1- entry from each column. If A' contains one matrix from J-'{P'^^,i) 
for every l-permutation matrix P and every i G [2/ + 1], then A contains D 821+1- 

Proof. For each i < 2/ + 1 let di be the row number of the lowest 1-entry in the ith 
column of A' . Let t be any of the rows satisfying Vz di > dt. Let A^* be the dt x 21 
matrix obtained from A by removing the tth column and all rows below the rf^th row. 
Then A^* contains one matrix from every J-" G therefore by Lemma [3.41 the matrix 
A^* contains an occurrence of DS2i- By the choice oft, the matrix A contains DS2i+i- □ 

Corollary 3.6. For every k > 1 

Vk{n) > exBSfe+i(«)- 

Proof. When k is even, the result follows from Lemma 1331 since for every /-permutation 
matrix P and for every i G [2/ + 1], the set J^{P'^^,i) contains some {21 + l)-permutation 
matrix, namely the matrix without any row contractions. The result for k odd follows 
from Lemma [3.41 □ 

The row contractions did not play any role in the proof of Corollary 13. 6[ but they 
will play a role in Section 13.31 below. 

Corollary 3.7. We have 

P3{n) > 2na{n) — 0{n), 

Pk{n) > n2(i/*')"(")*-°("(")'"') for k>A, 

where t := [{k - 2)/2j. 

Proof. The lower bound for = 3 is by Lemma 13^ and Corollary 13. 61 and from Lemma [3l3l 
and Corollary 13.61 when k is even and k > 3. When k is odd and /c > 5, we use 
Pk{n)>Pk-i{n). □ 

3.3 Sets of permutations 

Proof of Theorem \1.2[ Given k and n, we take the DS k+l-^^/oiding n x n matrix Ak^n 
from Lemma [3.21 if A; = 3 or from Lemma [3.31 if > 4 is even. Let Pk{n) be the number 
of 1-entries that A^n has in every column, that is P2{n) = 2a{n) — 0(1) and for t > 1 

P2t+2{n) = 2(V*!)a(n)*-0(a(n)*-l)_ 

From Ak^n we construct a set of Pk{n)^ n x n function matrices by choosing some 
1-entry from each column. Then we remove all empty rows, which can make some 
originally different function matrices identical. However, the resulting set "H has size at 
least pfc(n)"/2" as there are at most 2" distinct ways to enlarge a function matrix by 
adding empty rows to a matrix with n rows. 

The last step is inflating the rows of the function matrices in Ti into diagonal matrices 
to obtain a set Q of ra-permutation matrices. That is, for every H E H, we order the 
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1-entries primarily by the rows from top to bottom and secondarily from left to right. 
The permutation matrix Q will have 1-entries on those positions such that H has 
its zth 1-entry in column j. The reverse process consists of contracting intervals of 
rows of a permutation matrix Q and we have at most 2" possibilities how to choose 
the intervals. Thus every permutation matrix can be created by expanding at most 2"^ 
different function matrices. The size of the set Q is 

' ' - 2"2" V 4 y ■ 

It remains to show that the VC-dimension of Q is at most (fc + 1). We assume for 
contradiction that for some {k + l)-tuple C of columns and every {k + l)-permutation 
matrix R there exists Q G Q that contains R on C. 

Consider some permutation matrix Q & Q and lei H E l-L be the function matrix 
from which Q was created. The matrix H can thus be constructed from Q by contracting 
some intervals of rows such that the restriction of Q on each of these intervals of rows is a 
diagonal matrix. So the only change that these contractions can make on an occurrence 
of P-^' in Q is that some pairs of its contractible rows can be contracted. Thus an 
occurrence of P"^^ in Q on the set C of columns can only be created from an occurrence 
of some F G T{P^'^) on C in H and in as well. Similarly, an occurrence of P^'^{i) 
on C in Q implies an occurrence of some matrix from T{P^'^,i) on C in H and Aj^^n- 
See Fig. H 



/ ° \ 
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Figure 4: Expansion of an occurrence of a matrix from J^lP"^^). 

Therefore for k > A even, for every (A;/2)-permutation matrix P and every i G + 
some matrix from J^{P^'^,i) occurs on C in Ak^n- Thus, by Lemma 13.51 A^^n contains 
DSk+i, a contradiction. Similarly if A; = 3, we get a contradiction by Lemma [3. 4[ □ 
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